CPG-Actor: Reinforcement Learning for Central Pattern Generators
نویسندگان
چکیده
Central Pattern Generators (CPGs) have several properties desirable for locomotion: they generate smooth trajectories, are robust to perturbations and simple implement. However, notoriously difficult tune commonly operate in an open-loop manner. This paper proposes a new methodology that allows tuning CPG controllers through gradient-based optimisation Reinforcement Learning (RL) setting. In particular, we show how CPGs can directly be integrated as the Actor Actor-Critic formulation. Additionally, demonstrate this change permits us integrate highly non-linear feedback from sensory perception reshape oscillators’ dynamics. Our results on locomotion task using single-leg hopper explicitly rather than part of environment significant increase reward gained over time (20\(\times \) more) compared with previous approaches. Finally, our closed-loop progressively improves hopping behaviour longer training epochs relying only basic functions.
منابع مشابه
Central pattern generators
What are they? Central pattern generators (CPGs) are relatively small, relatively autonomous groups of neurons (neural networks) that produce patterned, rhythmic neural outputs that drive rhythmic behaviours. In addition to generating boring behaviours like walking, CPGs are also responsible for dancing, chewing, swallowing, suckling, copulation and orgasm — all the things that make life worthw...
متن کاملReinforcement Learning for CPG-Driven Biped Robot
Animal’s rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs). This article presents a reinforcement learning (RL) method for a CPG controller, which is inspired by the control mechanism of animals. Because the CPG controller is an instance of recurrent neural networks, a naive application of RL involves difficulties. ...
متن کاملDistributed Online Learning of Central Pattern Generators in Modular Robots
In this paper we study distributed online learning of locomotion gaits for modular robots. The learning is based on a stochastic approximation method, SPSA, which optimizes the parameters of coupled oscillators used to generate periodic actuation patterns. The strategy is implemented in a distributed fashion, based on a globally shared reward signal, but otherwise utilizing local communication ...
متن کاملCentral pattern generators for bipedal locomotion.
Golubitsky, Stewart, Buono and Collins proposed two models for the achitecture of central pattern generators (CPGs): one for bipeds (which we call leg) and one for quadrupeds (which we call quad). In this paper we use symmetry techniques to classify the possible spatiotemporal symmetries of periodic solutions that can exist in leg (there are 10 nontrivial types) and we explore the possibility t...
متن کاملCommentary/Selverston: Central pattern generators
and ion channels must be functionally described in order to obtain a full understanding of a CPG, we will not have a detailed, mechanistic explanation for some considerable length of time. A complete compilation of the detailed molecular biophysics of neurons will long remain the "quark" of cellular and integrative neurobiology. The most disturbing feature of the Selverston paper is its pessimi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-89177-0_3